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1 Introduction 

The ferromagnetic Ising model on finite graph G= {V,E) at inverse temperature fi >0 with 
external field /; 6 R is defined by the Gibbs measure 

PG./i.A((^) = a6{-l, + lf, (1.1) 

with Hamiltonian 

Hcjiio) = - E E (1-2) 

ijeE iev 

and partition function 

Zc.p.h= E ( 13 ) 

(Te{-l.l}^ 

The central computational challenge associated with the measure is to estimate 
certain moments of physically relevant random variables. Chief amongst such moments are 
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the two-point correlation function and susceptibility. While the problem is simple to pose, 
studying such quantities on large graphs is typically a computationally demanding task, and 
it is therefore not surprising that the Markov-chain Monte Carlo method is one of the most 
common approaches employed. This computational intractability can in fact be made precise 
in the language of complexity theory. It was recently established ll42l that computing the 
susceptibility of the Ising model is #P-hard, and the #P-hardness of the two-point correlation 
function then follows as an immediate corollary. Various #P-hardness results are also known 
for the Ising partition function I23lll3l l3l. 

In order for a particular Markov chain to provide efficient estimators, it is necessary 
that it converges rapidly to stationarity. Consequently, in addition to the single-spin Glauber 
process 1^ . which has a direct physical interpretation, a host of more sophisticated pro¬ 
cesses have been devised, with the aim of improving the efficiency of the resulting estima¬ 
tors. These processes typically augment, or replace, the spin measure in) with a partic¬ 
ular graphical representation. For example. Sweeny l43l proposed studying the zero-field 
Potts model using the single-bond Glauber process for the Fortuin-Kasteleyn model mi 
mi, while the Swendsen-Wang (SW) process simulates a natural coupling llT4l of the 
Fortuin-Kasteleyn model and zero-field Potts model. Similarly, Jerrum and Sinclair 1231 
studied a single-bond Metropolis process for the high-temperature representation of the 
Ising model in a strictly positive field. Prokof’ev and Svistunov ll^ also considered a space 
of Ising high temperature graphs, however their wonn process applies to the case of zero 
field, and uses a novel choice of local moves. Given this abundance of Markov-chain Monte 
Carlo algorithms for the Ising model, one naturally seeks to understand and compare their 
efficiencies. 

A key quantity for characterizing the rate of convergence of reversible Markov chains 
is the relaxation time. For a reversible finite Markov chain with transition matrix P, let A* 
denote the maximum absolute value of the non-trivial eigenvalues of P. The relaxation time 
of P is then simply the reciprocal of the (absolute) spectral gap 
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(1.4) 


Each of the abovementioned types of process (single-spin Glauber, Swendsen-Wang etc), 
can be applied to the Ising model on any finite graph. Let be a given infinite family of 
finite graphs. For a given type of process, and a given choice of Ising parameters j5,h, for 
each G e we can consider the corresponding process for the Ising model on G with 
parameters jS,/i. For each G E ^, we then have a corresponding value of fjei. If the map 
G I—)■ trei can be bounded above by a polynomial in |y(G) |, one says that type of process is 
rapidly mixing on for the given choice of parameters j6, h. Otherwise, the mixing is said 
to be torpid. 

There is a vast literature discussing the mixing of the Ising Glauber process; see for 
example the survey ll3Tl . It has recently been shown I35l that if (A — 1) tanh j5 < 1, then the 
Ising Glauber process is rapidly mixing for all graphs of maximum degree A. This result is 
tight in the sense that if {A — l)tanhj6 > 1, then, with high probability, the mixing of the 
zero-field Ising Glauber process on random A-regular graphs is torpid n~6]l . For finite boxes 
in Z^, the zero-field Glauber process is rapidly mixing above the critical temperature 11341 . 
and at the critical temperature ll32l . but torpid below the critical temperature (6l. The same 
behaviour is known to occur in zero field on the complete graph, where the mixing is very 
well understood im at all temperatures. 

The Swendsen-Wang process has also been the subject of significant study. We focus 
on results for the Ising case. Mixing on the complete graph is very well understood iTimi 
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at all temperatures. For graphs with bounded maximum degree, rapid mixing was estab¬ 
lished for all sufficiently high temperatures in fS). More recently, it has been shown that 
for such graphs, rapid mixing of the single-site Glauber process implies rapid mixing of the 
Swendsen-Wang process I45II46II . In particular, given the single-site Glauber results men¬ 
tioned above, this established rapid mixing of the Swendsen-Wang process on finite boxes 
in 1? for all temperatures at and above the critical temperature. Comparison results between 
the Swendsen-Wang process and the single-bond Glauber process for the Fortuin-Kasteleyn 
model have also been recently established BM, which show that the single-bond process 
is rapidly mixing iff the Swendsen-Wang process is rapidly mixing. As a consequence, by 
exploiting the duality of the Fortuin-Kasteleyn model, this established rapid mixing of the 
Swendsen-Wang process on boxes in 1? at all temperatures below the critical temperature. 

Currently, perhaps the best understood of the above mentioned processes, however, is 
the Jerrum-Sinclair-Ising (JSI) process ||2^ . which is known to be rapidly mixing on all 
graphs, at all temperatures, provided the field is strictly positive. Specifically: 

Theorem 1.1 (Jerrum-Sinclair 1231141 H i. The relaxation time of the JSI process on any finite 
connected graph with m vertices, at any temperature, and in afield h> 0 satisfies 



where p = tanh(/r). 

Remark 1.2. The divergence of the upper bound for as p ^ 0 is to be expected, given 

that the process is not irreducible when h = p = 0. 

As a consequence of Theorem | 1. 1 [ in I23II afully-polynomial randomized approximation 
scheme (fpras) was constructed for the Ising partition function, as well as the mean energy 
and magnetization. An fpras II26II23II21II48I for an Ising observable Xq p jj is a randomized 
algorithm which, given as input a problem instance (G, j5 , h) and real numbers e, r; 6 (0,1], 
outputs a random number Y satisfying 


IP[(1 - £)Xc.fi,h <Y< {\+e)Xg,pm] > 1 - 


(1.5) 


in a time which is at most a polynomial in |y|, and Notably, despite the fact that the 
JSI process is not irreducible when h = 0, the fprases constructed in lf23l are valid for all h > 
0, including = 0. In |j39), a fully-polynomial approximate generator from the measure 113 
is described, which uses successive calls of the partition function fpras presented in II23I 
to generate approximate samples from the Fortuin-Kasteleyn measure, from which Ising 
samples can then be generated using the Edwards-Sokal coupling fl4l . 

In the present article, we study the mixing of the worm process. Like the JSI process, 
the worm process is based on the high temperature expansion of the Ising model. However, 
the key difference between the two approaches is that while l23l considered only strictly 
positive fields in their high temperature expansions, ll38ll considered only the case of strictly 
zero field. As a consequence, while the configuration space of the measure considered in II23I 
consists of the full edge space of G, the configuration space introduced in consists of 
spanning subgraphs subject to strong constraints on the vertex degrees. While this may at 
first sight seem to be a disadvantage, the worm process provides a simple and natural method 
of sampling from this non-trivial space of combinatorial objects, and gives rise to particu¬ 
larly natural estimators for the Ising susceptibility, and two-point correlation function. 

The worm process was first introduced in the context of quantum Monte Carlo in II37I . 
and classical versions were subsequently described in f38l . In O, a numerical study of the 
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worm process concluded that it is currently the most efficient method known for estimat¬ 
ing the susceptibility of the three-dimensional Ising model. Numerical evidence presented 
in ^54]| also suggests it provides a very efficient method for studying the Ising two-point cor¬ 
relation function. Applications and extensions of the worm process now constitute an active 

1^ . To our knowledge, however, no rigorous results have previously been reported on the 
mixing of the worm process. 

In the current article, we prove that the worm process for the zero-held Ising model is 
rapidly mixing on all connected graphs, at all temperatures. As a corollary, we show that 
the standard worm estimators for the susceptibility and two-point correlation functions used 
by computational physicists, which are simply sample means of natural observables, dehne 
fully-polynomial randomized approximation schemes for these quantities. In the latter case, 
we restrict attention to the correlations between pairs of vertices whose graph distance is 
bounded above by some hxed distance k eN. 


1.1 Outline 

Let us outline the remainder of this article. Section fL^ establishes some preliminary notation 
and terminology that we shall use throughout. Our main results on the relaxation and mixing 
times are then stated in Section o In Section we introduce and study the Prokof’ev- 
Svistunov (PS) measure, and describe its relationship to the Ising model. The PS measure 
is the stationary distribution of the worm process, whose definition we give in Section [2^ 
The proof of Theorem 1 1.3 1 is presented in Section[^ Finally, Sectionj^describes how to use 
the mixing time bound to construct fully-polynomial randomized approximation schemes 
for the Ising susceptibility and two-point correlations. 


1.2 Preliminaries 

For a finite graph G = {V,E), we set n = |y| and m = \E\, and denote the maximum degree 
by A. For any pair of vertices m, v 6 V, we letd(M, v) denote the graph distance between them. 
To avoid trivialities, we shall always assume m > 1. For simplicity, as the underlying graph 
G can be considered fixed throughout, we suppress explicit mention of G in our notation. 

We shall be interested in certain random spanning subgraphs of G. To avoid confusion, 
we shall denote the empty set in 2^ by 0, as distinct from the empty set in the corresponding 
sigma algebra 2^ , which we simply denote 0. Since 2^ forms a vector space over Z 2 , in 
which 0 is the zero vector, this notation seems quite natural; see e.g. cni. 

If a vertex in a given graph has odd degree, then we shall call it an odd vertex. For A C E, 
the set of odd vertices in the spanning subgraph {V,A) will be denoted by dA. For a given 
set A C £, we let dA (v) be the degree of v £ P in the spanning subgraph (V,A), i.e. 

^^(v) := #{m £ y: MV £ A}. (1.6) 

For w £ N, we write [«] := {1,2,... ,w}. 

We emphasize that henceforth, in all that follows, we shall focus entirely on the case 
of zero field, in which we set h = 0 in ( |1.2[ l. In addition, since ( |l.l| l is trivial when the 
temperature is either zero or infinite, and to avoid trivial but tedious technicalities, we shall 
confine our attention at all times to the case 0 < j5 < °°. 
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1.3 Statement of Main Results 


Given an aperiodic and irreducible Markov chain with finite state space Q,, transition matrix 
P, and stationary distribution n, and given a prescribed 8 e (0,1), one defines (see e.g. 1291 
l2TlfTl l the mixing time from state co 6 to be 

tmix(®,5) :=min{t e N : \\P'{a,-)-7t\\ < 5}, (1.7) 

where || ■ || denotes total variation distance. One further defines := niaXfl,g ^2 tmix(®> ^)- 

Theorem 1.3. Consider the zero-field ferromagnetic Ising model on a finite connected graph 
at inverse temperature /3 > 0, and let x = tanhjS. The corresponding worm process, defined 
by the transition matrix ( |2.9[ l, satisfies: 

frei < AAmn^, 


and for any 5 G (0,1) 


fmix(0,5) <4 


.(5) <4 


log 2 + 


log 5" 


A n^. 


log 


log 25 


A m^ n^. 


As discussed in Section]^ the state 0 is the natural state in which to initialize the worm 
process, which explains the special treatment afforded it in Theorem |1. 3 1 


2 Definition of the PS measure and worm proeess 


2.1 High temperature expansions 


We begin by recalling the standard high-temperature expansion for the Ising correlation 
functions (see e.g. O), and note some simple consequences of it that will form key ingredi¬ 
ents in our proof Theorem |1. 3 1 We begin with some notation. 

Given x £ (0,1) and a finite graph G = {y,E), we define the following measure on 2^, 

A,(5):= 5C2^. (2.1) 

AeS 


We emphasize that while il.t(0) = 0, by contrast il,v(0) = 1. In addition, for any W C V we 
let 

^W={ACE:dA = W}, (2.2) 

and, in a slight abuse of notation, for any pair of vertices m , v we will write ^uv = ^{u.v} > 
for k G [n] we will write 

‘^k ■■= [j (2.3) 

wcv 

\W\=k 


Lemma 2.1. Consider a finite graph G = {V,E) and inverse temperature j5 > 0. Let Ep 
denote expectation with respect to the zero-field Ising measure on G, defined in and 

let Xx denote the corresponding measure on if with x = tanh(jS). Then, for any W C V, vre 
have 


% 


n 

iew 


Oi 


Xxjfw) 
Xx(‘^o) ■ 
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Proof. Begin by observing that for ct e {— 1,1 we have 

[cosh(/3) + OiOj sinh(/3)] 

ij£E ij^E 

= cosh"'(;S) P^[l + CT,(J;x] 

ij€E 

= cosh'"(jS) Y, n 

ACE ijCA 

= cosh'"(jS) 

ACE i^V 


This implies 


z/i= E 

tTG{-l.l}'' ijCE 

= cosh'”(/3)i: vl^lpi 

ace iGVc7,G{-l,+ l} 

= 2 " cosh"’(;S)A^(^o) 


since the sum L(T;6 {-i,+i} cjf equals 0 for k odd, and 2 for k even. Likewise, 


% 



1 


E n cTT 


P <7G{-1,1}^' Jew ijCE 

1 




1^1 


n E 

;gv\w (t,g{-i,+i} 



n E ^ 

jeW t7jG{-l,+l} 




^x{^w) 

XiM' 


□ 


Corollary 2.2. Consider a finite graph G 
k e [n] 

Av(%) - ’ 


{y,E) and x G (0,1). For any W CV, and any 

4(^o) “ W' 


Proof. Since (llyeiv < L we have E^g (riyGW ^j) ^ t>tid Lemma 2.1 then immediately 


implies the first stated result. To obtain the second result, we note that since fl = 0 
whenever W / W', the first result implies 


^xi%) ^v(‘^o) “ \kj ’ 


where denotes the set of all subsets of V of size k. 


□ 
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2.2 Prokof’ev-Svistunov measure 


One simple consequence of Lemma HD is that the variance of the Ising magnetization 
CT/ satisfies 


var^.^ = Y, %(o'«crv) 

u.veV 


nXxi^o) + 2Af ('^ 2 ) 

4(^o) 


(2.4) 


The susceptibility Xj} = -var^(.^) therefore satisfies 

_L ^ nXx{%) 
xp nA^(‘^o)+2 Aj:(^2) 

This motivates introducing the configuration space W = 0*^2, and the probability mea¬ 

sure Tlx defined by 


nA,('^o)+2A.(<^2)’ 


V/(A) := 



A £ 

Ae'^2. 


(2.6) 


Consideration of the probability space (1^, Kx) was first proposed in ll38]l . We refer to %x as 
the Prokof’ev-Svistunov (PS) measure. Several Ising observables can be expressed neatly in 
terms of the PS measure, including the susceptibility 


1 


and the two-point correlation function 


n ^xi^uv) 

2 ;r.(^o) ■ 


(2.7) 


(2.8) 


2.3 Worm process 

The worm process is a Markov chain on the state space W, constructed by metropolizing 
the following proposals with respect to the PS measure {2.6) . 

- If A £ '^ 0 : 

- choose a uniformly random vertex v £ P 

- choose a uniformly random neighbour m ~ v 

- propose A i-£ AAuv 

- If A £ ■^ 2 : 

- choose a uniformly random odd vertex v E dA 

- choose a uniformly random neighbour m ~ v 

- propose A 1 —^ AAuv 

Here AAuv denotes symmetric difference of A and the edge uv, i.e. if uv £ A we propose to 
delete it, while if mv 0 A we propose to add it. We begin by observing that these proposals 
are indeed well defined, in the sense that we necessarily have AAuv £ W. To see this, we 
first note the following elementary lemma. 
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Lemma 2.3. Consider a finite graph G = {V,E). IfA,B C E, then d{AAB) = {dA)A{dB). 


Proof. If V eV, then 

dAAB{v) = dA{v) + dB(v) - IdAnsiv). 

Therefore, dAAsiv) is odd iff dA{v) +dB{v) is odd. The latter holds iff either d'yi(v) is odd 
and dsiv) is even, or dA{v) is even and dsiv) is odd. Consequently, v £ d{AAB) iff either 
V E dA and v ^ dB, or v £ dB and v ^ dA. Therefore, v £ d{AAB) iff v £ {dA)A{dB). □ 

It follows from Lemma [23] that if A £ ■%, then 

d{AAuv) = ((9A)A(5{m, v}) = 0 A{m, v} = {m, v}, 

and so AAmv £ ‘^ 2 - Conversely, if A £ “^2 with dA = {u,w}, then Lemma |2^ implies 


d(AAuv) = ((9A)A((9{m, v}) = {u,w}A{u,v} 


j {w, v} wfiv, 
1 0 w = v, 


which, in turn, implies that either AAuv £ 'W 2 or AAmv £ Consequently, the worm pro¬ 
posals do indeed yield a well-defined transition matrix on W. 

To ensure the eigenvalues of the worm transition matrix are strictly positive, we consider 
a lazy version of the metropolized proposals. This means that at each step, with probability 
1/2 we send A 1 —>■ A, and with probability 1/2 we propose a non-trivial transition, which 
is then accepted with the appropriate Metropolis acceptance rate. The resulting transition 
matrix can then be described as follows. 


Px{A,AAuv) 


Huv^A] _ 


2n 




1 

4 


^d{u) d(v)^ 

1 1 \ 

d(u) d(v} J ’ 


A E ^0 


AAuv 6 ^0 


min ( ^ ^ <^ 2 , AAuv E^ 2 ,uE dA 

\ \d{v) J Ad(u) 

(2.9) 

All other non-diagonal entries of P^ are zero. We refer to Px with x = tanh jS as the worm 
process corresponding to the Ising model with inverse temperature fi. 

By construction, Px is lazy (and therefore aperiodic) and reversible with respect to TZx. 
Corollary |3.4| to be discussed in Section [^ shows that it is also irreducible. The laziness 
of Px ensures that r^i is simply the reciprocal of the difference between the two largest 
eigenvalues of Px- For later use, we note the following lower bound. 


Lemma 2.4. Consider a finite graph G = {V,E) andx £ (0,1). If A £ W and e E E, then 


^|AUe| 

V/(A)A,(A)P,(A,AAe) > 


2A 
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Proof. If A e *^0 or A Amv £ “Wq, then the result can be seen by inspection from ( |2.9| l. Suppose 
instead that both A and AAuv belong to ^ 2 , and that u 6 dA. Then 



jj.|AUhv| 


> 


2A 


□ 


2.4 Bounds on the PS measure 

We conclude this section with two lemmas concerning the PS measure. Lemma [23] is re¬ 
quired in our proof of Theorem |1.3| in Section]^ while Lemma jThj is required in our discus¬ 
sion of fprases in Section]^ 

Lemma 2.5. Consider a finite connected graph G= {V,E) andxCz (0,1). Then 7t.c{0) > 2^"', 
and for all A we have 



Proof. Let ;= + 2Ax(^2) denote the partition function of the PS measure. Then 

from Corollary|2.2|it follows that 



< n^X\{%) 
= n^2"'-"+^ 


To evaluate Ai (^ 0 ) we have used the fact that for any finite connected graph, the cardinality 
of the cycle space is 2'"^"+^ (see, e.g. uni)- 

By definition, ZxKx{A) equals either if A £ %j, or 2x^^^ if A £ '^ 2 - Therefore 


?t.r(0) 


n 


> 2-"’ 


> 2-"\ 


n 


Likewise, since Zx7tx{A) > 2vl'^l > 2x™ for all A £ W, we have 

2x^ 


- „22m-«+l 



1 /a:\'«2«+' 


□ 
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Lemma 2.6. The PS measure on finite graph G = {V,E) with parameter x G (0,1) satisfies 

n 

for all M, V G y. 

Proof. We begin with the bound for ;r^(‘^o)- Using Corollary |2.2| we obtain 

1 _ n?ix{^o)+2?ix{^2) _ 2 Xx{W 2 ) ^ 2 /n\ ^ 

n^x{%) n ?Ix{'^q) ~ n V2/ 

Likewise, Corollary |2.2| also implies 

1 _ n Xx{^o) ^ n AtC^o) /n\ Ax('^o) _ ^ 

Ttxi^,,) ~ 2 ^ Xxi^uxd - 2 Xxi^uv) U; U^^uv) ~ 2 Xxi^uv) ■ ^ ^ 

Now specify a shortest path pm, between u and v, and observe that 

\A/\puv\ > 1^1 - \Pm’\ = \A\-d{u,v). 

Lemma [2^ implies that the map a : 'Wuv —^defined by a[A) = AAp^, is a bijection. It 
follows that 

A(%)= E Y. (2.11) 

AE'^O AG^uv A^^iiv 

The stated result follows by combining ( |2.10| > and ( |2.11| >. □ 

Lemma 2.7. Consider a finite graph G = (V,E), and a set VL C V. IfE G then the map 
(X : ^ defined by (x(A) = AAE is a bijection. 

Proof. Let A G '^o and set A' = AAE. Lemma [23] implies that 

dA' = {dA)A{dF) = dE = W, 

and so A' G ‘^vv- Since A = 0 !(A'), this implies that a is surjective. 

Now suppose that a(A) = a(A') for A,A' G ThenAAF = A' AE. But taking sym¬ 

metric difference of both sides with F immediately implies A = A'. Therefore a is injec¬ 
tive. □ 


3 Proof of Rapid Mixing 

Consider an irreducible and reversible Markov chain, with finite state space Q , transition 
matrix P, and stationary distribution n. Let = {Q.,S’p) denote the transition graph of P, 
where = {(A,A') G : P{A,A') > 0}. It is natural to consider l^p to be a directed graph, 
but we note that since P is assumed reversible, the edges of f^p occur in anti-parallel pairs. 
To avoid confusion between i^p and the underlying graph G on which we define the Ising 
model, we shall always refer to the elements of (Pp as transitions, and reserve the word edge 
for elements of the edge set of G. Our proof of Theorem [T3] makes essential use of the 
following result 1401 Corollary 3]. A similar result was proved in i24l . 
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Theorem 3.1 (Schweinsberg (2002) l40l ). Consider an irreducible and lazy Markov chain, 
with finite state space D. and transition matrix P, which is reversible with respect to the 
distribution Tt. Let ,5^ C LI be nonempty, and for each pair {1,F) £ 12 x J/’, specify a path 
Ji f in l^pfrom I to F. Let 

r = : (/,F) 6 12 X .5^} 

denote the collection of all such paths, and let -^{F) be the length of a longest path in F. 
For any transition T, let 


= {(/,F) £ 12 X .5^ ; yip 3 J}. 


Then 

frei < 4^(r)(p(r) 


where 


(p(F) := max • 

{A,A')eSp 


n{I)n{F) 




n{5T) n(A)P{A,A') 


Remark 3.2. We note that h40\l defines the relaxation time to be the reciprocal of the spec¬ 
tral gap, rather than the reciprocal of the absolute spectral gap, as considered here. For this 
reason, our statement of Theorem \3.1\ includes the added condition that the chains be lazy. 

In the context of the worm process, it is convenient to choose = '^q. This allows the 
construction of a natural choice of F, leading to the following result. 

Lemma 3.3. There exists a choice of paths F = {yi.F : {1 ,F) (z W x %)} such that 
(p(F)<An^ and -S?(L) < m. 

Corollary 3.4. The worm process on graph G with parameter x £ (0,1) is irreducible. 


Proof. Since the worm process is reversible, it suffices to show that it is possible to transition 
from an arbitrary state 7 £ to the fixed state 0, in a finite number of steps, with positive 
probability. Let //o denote the path from 7 to 0 described in Lemma 3.3 Since FL'fiF) < m, 
7/ 0 corresponds to a finite sequence of transitions. By construction, this sequence occurs 
with positive probability. □ 


Before we prove Lemma [L3j we use it to prove our main result. 

Proof of Theorem \L3\ Combining Lemma [T^ with Theorem |3. 1 [ immediately implies 

Lei<4zlmn^, (3.1) 


as required. To bound the mixing time, we appeal to the following general bound 




log 


1 

nfA) 


+ log 


Lei; 


(3.2) 


which holds for any irreducible and reversible finite Markov chain (see e.g. ED). Insert¬ 
ing l|3.1|> together with Lemma 2.5 into l|3.2|i, then yields the stated bounds for fmix(0,5) and 


tmix(5). □ 

It now remains only to prove Lemma 
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1 F 


Fig. 1 Example of a transition sequence Yi f. We order the vertices from left to right, and bottom to top. 
lAF = BqUBi, where the path Bq consists of the single edge V2V5, and the cycle B[ is V4V5V8V7V4. 


Proof of Lemma We begin by specifying a candidate set of paths, F, and then go on 

to bound <p(F). To this end, fix an [n]-valued vertex labeling of G. The labeling induces 
a lexicographical total order of the edges, which in turn induces a lexicographical total 
order on the set of all subgraphs of G. For each cycle, we additionally fix an orientation 
by demanding that from the lowest labeled vertex in the cycle we move to the lowest labeled 
of its neighbors. 

In order for the worm process to transition from I to F 6 '^ 0 , it suffices that it 
updates, precisely once, those edges in G which lie in the symmetric difference /AF. Since 
F e 'Wq, we have dF = 0, and so Lemmaimplies d{IAF) = dL Suppose that 7 £ ^2 
with dl = {m,v}, so that d(IAF) = As the sum of the degrees of each connected 

component of {V,IAF) must be even, u and v must belong to the same component. Of all 
the shortest paths in {V,IAF) between u and v, let Bq denote the edge set of the path which 
appears first in our fixed subgraph ordering for G. Now observe that lAF \ TJq £ ‘^o- Every 
element of can be decomposed into a (possibly empty) disjoint union of the edge sets 
of cycles in G (see e.g. (H §1 .9]). Decompose /AF \ Bq in this way, order the resulting 
cycles, and denote them by 5i,52,...,Fi.. Using this prescription, we therefore obtain a 
unique disjoint partition /AF = where Bq is a path and 5,- is a cycle for i £ [L]. 

We now define Yi,f as follows. The initial state in y/.f is I. Beginning from either u or 

V, according to which has the lowest label, unwind the path Bq. During this unwinding, the 
occupation status of each edge in Bq is inverted, in the order in which it occurs in Bq. This 
produces a sequence of states in W , the first being /, and the last being IABq, with each 
pair of consecutive states differing by a single edge in Bq. Next, unwind the Bi in order, for 
!= 1,2,in each case beginning with the lowest labeled edge and proceeding according 
to the fixed cycle orientation. After having unwound B^, we obtain a sequence of states in 

W, the first being I and the last being /A(uf^QS,) = F, with each pair of consecutive states 
in the sequence differing by a single distinct edge in /AF. Let y/.f denote this sequence 
of states. If the initial state I lies in %q rather than ‘^ 2 , the above construction still holds, 
but with Bq = 0, in which case one begins yj f by unwinding Si. Fig. [T] illustrates a simple 
example of a transition sequence Yi.f ■ 

We emphasize that each Yff constructed according to the above prescription is in fact a 
path in the transition graph. Indeed, from i |2.9[ l we see that the transition probability from any 
state in Yi.f to the next state in Yi.f is at least x/{nA). We have therefore constructed a path 
Yi.f from any 1 eW to any F £ ‘^q. Let F = {yi.f ■ {I,F) x ‘Wq} denote the collection 
of all such paths. Since each edge in G is updated at most once during the traversal of Yi.f, 
we have .Sf(r) < m. 

Given this choice of F, we must now bound <p(F). Define 


^T,k:={(fF)e^/cXWQ:Yi.F3T}, 
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and := ^r,o U - Following arguments similar to those in I22II21L for each transition 
T = {A,A') 6 we now introduce a map r\j : ^ U “^2 U “^4 via 


r\T{I,F) := /AFA(AUA'). 


Since T is of the form T = (A,AAe) for some e £ we have r\j{I,F) = /AFA(AUe). 
We claim that for each T 6 the map 777 is an injection. To show this, we demonstrate 
how to reconstruct 7 and F given rjjiljF). The first observation is that we can recover 
lAF by simply using lAF = r 77 (/,F)A(AUe). From our fixed graph labeling, we can then 
immediately infer the order in which the edges in lAF must be unwound when travers¬ 
ing the transition path 7 / 7 . Given this information, we can then begin in state A and un¬ 
wind the remaining edges in lAF specified by Yi.f, to recover F. We then recover I via 
I = riT{I,F)A(AUe)AF, and so rjT is indeed an injection. 

Next, we prove that for any T 6 we can bound the summand appearing in (p(F) by 


1 n^{I)Tt^{F) 

Ttx{A)Px{A,A') 


< ^^V/(/)A,(r77(/,F)). 


(3.3) 


Let T = (A, AAe) for some A^W and e E E. From Lemma 2.4 and the fact that F £ ' 
we have 


1 Kx(I)Kx{F) 
7^x{%) Kx{A)Px{A,A') 


< 


1 W)^x(.I)nXx{F) 
nXxi^o) w{A)Xx{A)Px{A,A') 


2A 

Xx[^q) 




2A 

Xx{%)) 


^(/);,l>lr(/T)l, 


The final equality is a consequence of the following elementary set-theoretic observation, 
which was utilized in an analogous context in t23\ . Let B be any subset of E satisfying 
ir\F C7JC/UF, and set U = lAFAB. It follows that U ClB = I ClF and UUB = lUF, 
and so the inclusion-exclusion principle yields 


|/|-b|F| = |/UF| + |/nF| = |7/UF|-f |7/nB| = \U\ + \B\. (3.4) 


Now, by definition, the path 7 / 7 modifies only the edges in lAF. Therefore, no edges in 
7 n F are updated when traversing Yi.f ■ Conversely, the only edges which can be updated 
in traversing Yff are those belonging to 7UF. It follows that AUe satisfies the constraint 
7nFCAUeC7UF. Choosing 5 = AUe and U = riT{l,F) in ( |3.4| l then yields 

|7| -I- |F| - |AUe| = \riT(I,F)\, 


which establishes (|3.3^. 
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Now let T = (A,AAe) e Sp^ be a maximally congested transition. It follows from l|3.3| 


that 


9(0 < E 

{l,F)e^T 




2A 

Ar('^o) 

2A 

XaW) 


E «A,(r]H/,0)+ E ^UnT{I,F)) 

iLF)e£^T,0 {I,F)e&>T,2 

[nXx{r\T{3^Tfi)) +'^^x{j\T{i^T,2))\ , 


(3.5) 


where the second equality follows from the fact that qj- is an injection, and T]T{^T,k) de¬ 
notes the image of the set I^T,k under the map rjj. By assumption, for any {I,F) e ^^T,k 
we have F e %. Since A U e £ W, Lemma 2.3 implies that for any (/jF) £ 3^T,k, the state 
riT{I,F) belongs to W U'^4. Moreover, if (/,F) £ ^pfl, then in fact riT{I,F) £ W. There¬ 
fore, T]r(<^r,o) ^ and r]T{^T.2) ^ ^ 13*^4, and it follows from Corollary |2.2| that 

9{F) T7c^ [’^^x{%) U ‘^ 2 ) +'2?ix{%) U “^2 U '^4)] 


= 2A 


< 2A 


\ , -,xAx('^2) „AA‘^4) 


(u -f 2) -|- (n + 2) j j -|- 2 


< Aw"^. 


□ 


4 Fully-polynomial randomized approximation schemes 

The rapid mixing of the worm process allows us to construct an efficient randomized ap¬ 
proximation scheme for the Ising susceptibility, as well as the correlation between the spins 
on any two sites whose distance is not more than some fixed value k. These schemes, which 
simply involve burning in, and then computing sample means of certain natural random 
variables, coincide exactly with what a computational physicist would do in practice. 

Before constructing these schemes, we address the issue of how to initialize a worm 
process. A cold start of a Markov chain refers to starting the process in a fixed initial state. 
Since the state 0 £ 1^ maximizes ?r, (A), it is the natural choice of initial state in a cold start 
of the worm process. Moreover, since directly constructing arbitrary elements of is a 
non-trivial task, initializing a worm process via a more general distribution on W is unlikely 
to be practical in actual simulations. Indeed, perhaps the simplest way to generate arbitrary 
elements of is via worm proposals, starting from 0. Therefore, in this section, we assume 
the worm process is started in the fixed state 0. 

We introduce some terminology which will prove convenient below. Consider a positive 
quantity jc > 0. We say another quantity x estimates x with relative error e if |i — x| < ex. 
In the case where x is random, we say that x provides an (e, 5)-approximation for x if the 
probability that x approximates x with relative error e is at least 1 — 5. In this language, 
an fpras for an Ising observable is a randomized algorithm which, for arbitrary choices of 
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(e, 5), provides an (e, 5)-approximation for the given observable, and which runs in a time 
polynomial in e^*, 5^^ and n. 

To establish that our worm estimators yield fprases, we require an appropriate concentra¬ 
tion result for the worm process. There are a number of Chernoff-type bounds available for 
finite Markov chains II17II30I . For our purposes, however, it is convenient to instead use II29I 
Theorem 12.19], which is obtained by bounding the mean square error and then applying 
Chebyshev’s inequality. Combining this with Theorem |1. 3 1 we obtain the following. 

Lemma 4.1. Let be a worm process on finite connected graph G with parameter 

X £ (0,1) andX q = 0. Let f :W ^ 1R+, and £,5 E If 


T = 


N> 




16e ^5 ^Amn^ 


E..(/) 


then 


1 t+N 

/(X,)-E^,(/) 


N 


t=z+\ 


>£^nAf) < 5 . 


(4.1) 

(4.2) 

(4.3) 


In particular, the worm process provides an [e, 5)-approximation ofEji^i^f) in time of order 


N T = O { A mn 


m + e 


2s-l 


E%/. 


Proof. The stated result follows almost immediately from Theorem 1 1.3 [ and ll29l Theorem 
12.19]. Although the latter theorem is stated in a form which is uniform over all possible 
initial states used in a cold start, the proof given in |(29l actually establishes the following 
slightly sharper result, in which the dependence on the initial state is explicit. Specifically, 

if T > tmix(0,5/2) and A> then 


Fo 


s=0 




<5. 


Since / is assumed positive, we have var^j,. (/) < ||/||ooE;i:,. (/). The stated result then follows 
from Theorem 1 1.3 1 □ 


We note that there is a simple and standard method, often referred to as the median trick. 
to replace the linear dependence of 5^* in the running time of the method in Lemma 


4.1 


with a logarithmic dependence. Set 5 = 1/4, and choose T and N as specified in Lemma 4.1 


Use the worm process to generate k independent estimates Y\,.. .,Yi_ of E^j^ (/) with these 
parameters. Choose k = 6[lgT)^*] -|- 1, and let Y be the median of this set of estimates. 
Then 12511231 we have 

¥o{\Y-E„J\ > eE^J) < rj. 


The choice of 5 = 1/4 is arbitrary; choosing any other fixed 5 £ (0,1 /2) produces a similar 
result. The constants in the bound can be sharpened somewhat by choosing 5 more carefully; 
see t36i for a discussion. 
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4.1 Susceptibility 


In this section, we apply Lemma 4.1 to the estimation of the susceptibility. Let e e (0,1). 


We begin with the elementary and general observation that if a quantity x estimates a quan¬ 
tity X with relative error e/(l -|- e), then 1 /i estimates 1 jx with relative error e. If we con¬ 
struct a quantity Sq which estimates with relative error e/(l -|-e), it therefore follows 

from \2.1) that 1/So estimates Xj 3 with relative error e. 

Let (X,),gN be a worm process on finite connected graph G with parameter x = tanh(;S) 
and Xq = 0. Since Lemma[23]implies 




< n, 


setting / = l<gjj and e i—^ e/( 1 -|- e) in Lemma 4.1 shows that if 


with T given by (|4.1|l and 


j t+N 

E W) 

f=r+l 


Af=|'l6e ^(l-l-e)^5 


(4.4) 


(4.5) 


then 1/So provides an (s, 5)-approximation of Xp in time Al-|- T = 0(5 *e This 

procedure therefore defines an fpras for xp ■ 


4.2 Correlation function 


Lemma [4T| can also be applied to the two-point correlations. In general, if quantities x and y 
estimate quantities x and y with relative error e/(2 -|- e), then for any positive constant c, the 
quantity cx/y estimates cx/y with relative error e. If So and Suv respectively estimate %(%) 
and Kx^^uv), each with relative error e/(2 -|- e), it therefore follows from (|2.8|l that 


n Sux 

2^ 


(4.6) 


estimates with relative error e. 

Let (26),gN be a worm process on a finite connected graph G = {V,E) with parameter 
X = tanh(jS) and Xq = 0. We can again estimate 7c(‘^o) with So given by \AA) , and we can 
likewise estimate with 


1 T+A^ 

Su.-.= - 1%,(X,). (4.7) 

f=T+l 

We emphasize that So and S„,, can both be computed from the same realization of (26),gN. If 
we demand that So and S„,, respectively estimate 7Zx{'^o) and Kx{‘^uv) with probability at least 
1 — 5/2, then the union bound guarantees that \A.6) estimates E,,,.(cTmCTi.) with probability at 
least 1 — 5. 

Now fix k e N. Lemma [2(6| implies that for any u,v eV with d{u,v) < k, we have 

ll^vlU ^ 1 < 
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Therefore, choosing in l|4.7|i 


N = 


[l6e^2(2 + e)25^'4 



(4.8) 


and T as in 1 |4.1| ) with 5^5/2, implies that Suv provides an (e/(2 + e), 5/2)-approximation 
for nx[‘^uv)- Since ( |4.8| l is strictly larger than ( |4.5| l, using this same choice of N and T in l |4.4| i 
also implies that Sq provides an (e/(2 + e), 5/2)-approximation for Kx{‘^q). Therefore, with 
this choice of N and T, the estimator ( |4.6| l provides an (e,5)-approximation for E;i^(ct„CTv,) 
in time A^ + T = 0(e^^5^^A mn^x^^). 

Finally, let us define the k-restricted Ising two-point correlation function 


gk : {(m,v) 6 V X y : d{u,v) < A:} —s- M 


by gk{u,v) It follows that, for any fixed £ N, the above construction provides 

an fpras for the problem of computing 
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